Skip to content

[SYSTEMDS-3168] Matrix Multiplication Transposed Kernels#2425

Closed
Elmanjhg wants to merge 1 commit intoapache:mainfrom
Elmanjhg:SystemDS-3168-MatrixMultiplicationTransposedKernels
Closed

[SYSTEMDS-3168] Matrix Multiplication Transposed Kernels#2425
Elmanjhg wants to merge 1 commit intoapache:mainfrom
Elmanjhg:SystemDS-3168-MatrixMultiplicationTransposedKernels

Conversation

@Elmanjhg
Copy link
Copy Markdown
Contributor

Optimize dense matrix mult for transposed inputs

This introduces specialized kernels for dense matrix multiplication involving transposed inputs (t(A)%%B, A%%t(B), t(A)%*%t(B)). Previously, these operations required an explicit intermediate transpose step, which caused unnecessary runtime.

The new kernels perform the operations in-place or using tiled-transposition, avoiding the full allocation cost.

Performance benchmarks on 100x100 dense matrices show significant speedups especially for t(A)%%B and t(A)%%t(B) and can be tested with higher dimensions.

Optimize dense matrix mult for transposed inputs

This introduces specialized kernels for dense matrix multiplication
involving transposed inputs (t(A)%*%B, A%*%t(B), t(A)%*%t(B)).
Previously, these operations required an explicit intermediate transpose
step, which caused unnecessary runtime.

The new kernels perform the operations in-place or using tiled-transposition,
avoiding the full allocation cost.

Performance benchmarks on 100x100 dense matrices show significant speedups especially for t(A)%*%B
and t(A)%*%t(B) and can be tested with higher dimensions.
@anusha975
Copy link
Copy Markdown

Hi! I’d like to work on this issue. I’ve reviewed the current code and have a plan to implement the fix/feature. Thank you!

@mboehm7
Copy link
Copy Markdown
Contributor

mboehm7 commented Mar 29, 2026

Thanks for the great contribution @Elmanjhg. During the merge, I only fixed the wildcard import.

@mboehm7 mboehm7 closed this in ea00f88 Mar 29, 2026
@github-project-automation github-project-automation bot moved this from In Progress to Done in SystemDS PR Queue Mar 29, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

3 participants